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Abstract 

A link stream is a collection of triplets (f, it, v) indicating that an interaction 
occurred between u and v at time t. We generalize the classical notion of cliques 
in graphs to such link streams: for a given A, a A-clique is a set of nodes and 
a time interval such that all pairs of nodes in this set interact at least once 
during each sub-interval of duration A. We propose an algorithm to enumerate 
all maximal (in terms of nodes or time interval) cliques of a link stream, and 
illustrate its practical relevance on a real-world contact trace. 

Keywords: link streams, temporal networks, time-varying graphs, cliques, 
graphs, algorithms 


1. Introduction 


In a graph G = (V, E) with E C V x V, a clique C C V is a set of nodes such 
that C xC C E. In addition, C is maximal if it is included in no other clique. In 
other words, a maximal clique is a set of nodes such that all possible links exist 
between them, and there is no other node linked to all of them. Enumerating 
maximal cliques of a graph is one of the most fundamental problems in computer 
science, and it has many applications 1,113. 

A link stream L = (T, V, E) with T = [a, w] and E C TxVxV models inter¬ 
actions over time: l = ( t , u, v ) in E means that an interaction occurred between 
u £ V and v £ V at time t £ T. Link streams, also called temporal networks or 
time-varying graphs depending on the context, model many real-world data like 
contacts between individuals, email exchanges, or network traffic 0! QQ. 

For a given duration A, a A-clique C of L is a pair C = (X, [b, e]) with X C V 
and [6, e]CT such that \X\ > 2, and for all {it, v} C X and r £ [6, max(e —A, b )] 
there is a link (t,u, v) in E with t £ [r, min(r -f A, e)]. Notice that A-cliques 
necessarily have at least two nodes. 

More intuitively, all nodes in X interact at least once with all others at least 
every A from time b to time e. A-clique C is maximal if it is included in no 
other A-clique, (i.e. there exists no A-clique C' = (A'', [&', e']) such that C' ^ C, 
X C X' and [b, e] C [b',e']). See Figure |T] for an example. 

In real-world situations like the ones cited above, A-cliques are signatures of 
meetings, discussions, or distributed applications for instance. Moreover, just 
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Figure 1: Examples of A-cliques. We consider the link stream L = 
([0, 9], {a, b, c}, E) with E = {(3, a, b), (4, b, c), (5, a, c), (6, a, b)} and A = 3. 
There are four maximal 3-cliques in L: ({a, b}, [0,9]) (top left), ({a, b, c}, [2, 7]) 
(top right), ({&, c}, [1, 7]) (bottom left), and ({a, c}, [2, 8]) (bottom right). Notice 
that ({a, b, c}, [1, 7]) is not a A-clique since during time interval [1,4] of duration 
A = 3 there is no interaction between a and c. Notice also that ({a, b}, [1, 9]), 
for instance, is not maximal: it is included in ({a, b}, [0,9]). 


like cliques in a graph correspond to its subgraphs of density 1, A-cliques in 
a link stream correspond to its substreams of A-density 1, as defined in 12], 


Therefore, A-cliques in link streams are natural generalizations of cliques in 
graphs. 

In this paper, we propose the first algorithm for listing all maximal A-cliques 
of a given link stream. We illustrate the relevance of the concept and algorithm 
by computing maximal A-cliques of a real-world dataset. 

Before entering in the core of the presentation, notice that we consider here 
undirected links only: given a link stream L = (T, V. E), we make no distinction 
between ( t , u,v) £ E and ( t , v, u) £ E. Likewise, we suppose that there is no 
loop (t,v,v) in E , and no isolated node (Vu £ V, 3 (t,u,v) £ E). 

We finally define the first occurrence time of (w, v) after b as the smallest 
time t > b such that (t,u,v) £ E , and we denote it by ff mv ■ Conversely we 
denote the last occurrence time of (u, v) before e by l eU v We say that a link 
(t, u , v) is in C = (X, [b, e]) if u £ X, v £ X and t £ [6, e]. 


2. Algorithm 

One may trivially enumerate all maximal cliques in a graph as follows. One 
maintains a set M of previously found cliques (maximal or not), as well as a set 
S of candidate cliques. Then for each clique C in S , one removes C from S and 
searches for nodes outside C connected to all nodes in clique C, thus obtaining 
new cliques (one for each such node) larger than C. If one finds no such node, 
then clique C is maximal and it is part of the output. Otherwise, if the newly 
found cliques have not already been found (i.e., they do not belong to M), then 
one adds them to S and M. The set S is initialized with the trivial cliques 
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containing only one node, and all maximal cliques have been found when S is 
empty. The set M is used for memorization, and ensures that one does not 
examine the same clique more than once. In |7| the authors use this framework 
to enumerate all maximal cliques of a graph in lexicographic order. 

Our algorithm for finding A-cliques in link stream L = (T, V, E ) (Algo¬ 
rithm!]]) relies on the same scheme. We initialize the set S of candidate A-cliques 
and the set M of all found A-cliques with the trivial A-cliques ({a, b}, [t, t ]) for 
all ( t,a,b ) in E (Line [2]). Then, until S is empty ( while loop of Lines [ill to EH) , 
we pick an element (.X, [6, e]) in S (Line []} and search for nodes v outside X 
such that ( X U {i>}, [ b , e]) is a A-clique (Lines 151 to fill. We also look for a value 
b' < b such that (X, [b',e]) is a A-clique (Tines ITT1 to fl6l). and likewise a value 
e' > e such that ( X , [b, e']) is a A-clique (Lines [TT] to [22j). If we find such a node, 
such a b' or such an e! , then A-clique C is not maximal and we add to S and 
M the new A-cliques larger than C we just found Tines flTTl fl6l and EH . on the 
condition that they had not already been seen (i.e., they do not belong to M). 
Otherwise, C is maximal and is part of the output (Line EH . 


Algorithm 1 Maximal A-cliques of a link stream 


S i- 0, R <- 0, M 4- 0 

for (■ t,u,v ) € E: add ({u,u}, [t, t]) to S and to M 

while 5^0 do 

take and remove (X, [b,e]) from S 
set isMax to True 
for v in V \ X do 

if (X U {v}, [b, e]) is a A-clique then 
set isMax to False 
if (X U {c}, [b, e]) not in M then 
| add (X U {z>}, [6, e]) to S and M 

f 4 — max,, v ^x fbuv > latest first occurrence time of a link in ( X , [6, e]) 

set b' to / — A 
if b ^ b' then 

set isMax to False 
if (X, [b',e]) not in M then 
( add (X, [b',e]) to S and M 

l 4 — min,, v qx leuv t> earliest last occurrence time of a link in (X, \b,e]) 

set e' to l + A 
if e/c' then 

set isMax to False 
if (X, [b , e']) not in M then 
| add (X, [b, e'] ) to S and M 
if isMax then 
j add (X, [b, e]) to R 
return R 
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Let us explain the choice of b' (Lines |TT] to [TOT) in details, the choice of e' 
(Lines [19] to [22j being symmetrical. For a given A-clique (A, [b, e}). we set b' 
to / — A, which is the smallest time such that we are sure that (X , [b',e]) is a 
A-clique without inspecting any link outside of (X, [6, e]). Indeed, all links in 
A' x X appear at least once in the interval [/ — A, /]: / is the latest of the first 
occurrence times of all links in this A-clique, and so all links appear at least 
once in [b, /] C [/ — A, /]. lib' then the A-clique (A', [b r , e]) is added to S 
(LineflSl). 

We display in Figure [2] an example of a sequence of such operations from an 
initial trivial A-clique to a maximal A-clique in an illustrative link stream. The 
algorithm builds this way a set of A-cliques of L , which we call the configuration 
space; we display the configuration space for this simple example in Figure [3] 
together with the relations induced by the algorithm between these A-cliques. 
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Figure 2: A sequence of A-cliques built by our algorithm to find a maximal 
A-clique (bottom-right) from an initial trivial A-clique (top-left) in the link 
stream of Figure [T] when A = 3. 

From left to right and top to bottom: the algorithm starts with ({a, b}, [6,6]), 
and finds ({a, &}, [3, 6]) thanks to Lines [11] to [TH] of the algorithm. It then finds 
({a, b , c}, [3,6]) thanks to Lines [6] to [10] It finds ({a, b , c}, [3, 7]) from Lines [IT] 
to [22l and finally ({a, &, c}, [2, 7]) from Lines fill to fl6l 

To prove the validity of Algorithm |I] we must show that all the elements it 
outputs are A-cliques, that they are maximal, and that all maximal A-cliques 
are in its output. 

Lemma 1 . In Algorithm |7J all elements of S are A-cliques of L. 

Proof. We prove the claim by induction on the iterations of the while loop 
(Lines l3l to 1771) . 

Initially, all elements of S are A-cliques (Line [2J . Let us assume that all 
the elements of S are A-cliques at the i -th iteration of the loop (induction 
hypothesis). The loop may add new elements to S at Lines fltilfl6l and l22l In all 
cases, the added element is built from an element C = (A, [b, e]) of S (Line [4]), 
which is a A-clique by induction hypothesis. 

It is trivial (from the test at Line [7]) that Line [TU] only adds A-cliques. 
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Figure 3: The configuration space built by our algorithm from the link stream of 
Figures [l]and [2] when A = 3. Each element is a A-clique and it is linked to the 
A-cliques the algorithm builds from it (links are implicitly directed from top to 
bottom). Plain links indicate A-cliques discovered by Lines [TT] to [16] or Lines IT71 
to [22] of the algorithm, which change the time span of the clique. Dotted links 
indicate A-cliques discovered by Lines E3 to m which change the set of nodes 
involved in the clique. The bold path is the one detailed in Figure [2] Colors 
correspond to the maximal A-cliques displayed in Figure [T} 


Let us show that (X, [b, l + A]), where l is computed in Line[l71 necessarily 
is a A-clique. As (X, [6, e]) is a A-clique all links in X x X appear at least 
once every A from b to l < e. Moreover, since l is the earliest last occurrence 
time of a link in C, for all u and v in X there is necessarily a link (f, it, v) in E 
with l < t < e. Notice also that l > e — A, otherwise (X, [6, e]) would not be a 
A-clique. Therefore a link between u and v occurs at least once between l and 
l + A for all u and v in X. Finally, (X, [6, l + A]) is a A-clique. 

The same arguments hold for LinefTTl 

Finally, at the end of the (i + l)-th iteration of the loop, all the elements of 
S are A-cliques, which ends the proof. □ 

Lemma 2. All the elements of the set returned by Algorithm [7] are maximal 
A-cliques of L. 

Proof. Let C = (X, [6, e]) be an element of R returned by the algorithm. Only 
elements of S are added to R (at Line [53}, and so according to Lemma Q] C is 
a A-clique. Assume it is not maximal; then we are in one of the three following 
situations. 
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There exists v in V\X such that (IU{v}, [ b , e]) is a A-clique. Then v is found 
at Lines EH3 and Line [8] sets the boolean isMax to false. Therefore, Line [23] 
ensures that C = ( X , [b, e]) is not added to R, and we reach a contradiction. 

There exists e! > e such that (A, [6, e'\) is a A-clique and we assume without 
loss of generality that there is no link between nodes in X from e to e'. Then, let 
us consider l £ [b, e], computed in Line 1171 which is the earliest last occurrence 
time of a link in C. We necessarily have l > e' — A because (X, [b,e'}) is a 
A-clique. Since e' > e, this implies l > e — A. As a consequence, the test in 
Line [Hi] of the algorithm is satisfied, and Line [212] sets the boolean isMax to false. 
Like above, we reach a contradiction. 

If there exists b' < b such that ( X , [b r , e]) is a A-clique, then similarly to the 
previous case we reach a contradiction. 

Finally, C necessarily is maximal, which proves the claim. □ 

Before proving our main result, which is that all maximal A-cliques are 
returned by the algorithm, we need the following two intermediate results. 

Lemma 3. Let C = (A, [6, e]) be a maximal A-clique of L, and let s be the 
earliest occurrence time of a link in C. Then e > s + A. 

Proof. Since C is a A-clique and by definition of s, for all u,v in A there exists 
at least one link ( t,u,v ) such that s < t < e. Assume e < s + A; then for all 
u , v in A there also exists a link (t, u , v) such that s<f<e<s + A. Therefore 
(A, [6, s + A]) is a A-clique and C is included in it, which means that C is not 
maximal and we reach a contradiction. □ 

Lemma 4. Let C = (A, [6, e]) be a maximal A-clique of L and let s be the 
earliest occurrence time of a link in C. If (A, [s, s + A]) is in S at some stage 
of Algorithm^ then C is in the set returned by the algorithm. 

Proof. Assume Cq = (A, [s,s + A]) is in S and consider the longest sequence 
of steps of Algorithm Q] of the form: Cq —> C\ Cf such that for all i 

Ci = (A, [s, e*]) with e^+i > e*. In other words, the algorithm builds Cj+i from 
Ci in Lines [IT] to [22] (notice that e > s +A from Lemma [3] and so Co is included 
in C). 

We prove that = (A, [s, e]) by contradiction. Assume this is false, and 
so that efc ^ e. As C is maximal, we then necessarily have e*, < e. In addition, 
efc = l + A where l is the earliest last occurrence time of a link in Ck-i computed 
at LineUZl Since Ck is the last A-clique in the sequence, l is also the earliest last 
occurrence time of a link in Ck (otherwise there would be a clique Ck+i satisfying 
the constraints of the sequence above). Therefore there exist u, v £ X such that 
(. l,u,v ) £ E and such that there is no occurrence of a link (it,u) between l and 
efc = l + A. This ensures that there exists an e such that l + A + e < e and such 
that there is no link between u and v from l + e to / + A + e, which contradicts 
the assumption that C is a A-clique. 

We now show that the algorithm builds C from Ck to end the proof. Since 
C is maximal, there exists u,v £ X such that (b + A, u, v) £ E and such that 
there is no other link between u and v from & to b + A. By definition of s, 
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b + A > s. Therefore the latest first occurrence time of a link in Ck, /, is equal 
to b + A and Lines |TT] to [16] build C from Ck- □ 

Lemma 5. All maximal A-cliques of L are in the set returned by Algorithm QJ 

Proof. It is easy to check that if S contains a maximal A-clique then it is added 
to the set R returned by the algorithm, and only these A-cliques are added to 
R. We therefore show that all maximal A-cliques are in S at some stage. 

Let C = (A', [ b , e]) be a maximal A-clique of L , let s be the earliest occurrence 
time of a link in C, and let u,v £ X be two nodes such that there exists a link 
between them at s (i.e., ( s,u,v ) £ E). We show that there is a sequence of 
steps of the algorithm that builds C from A-clique Cq = ({u,u}, [s, s]) (which 
is placed in S at the beginning of the algorithm, Line EJ)- 
Lines Q7] to [22] builds C\ = ({u,v}, [s, s + A]) from Cq. 

Notice that for all subset Y of X, (Y, [s, s + A]) is a A-clique. Therefore the 
algorithm iteratively adds all elements of X at Lines [5] to [101 finally obtaining 
C' = (A, [s,s + A]) from C x . 

We finally apply Lemma [I] to conclude that the algorithm builds C from C'. 

□ 


From these lemmas, we finally obtain the following result. 

Theorem 1. Given a link stream L and a duration A, Algorithm [7] computes 
the set of all maximal A-cliques of L. 

In order to investigate the complexity of our algorithm, let us denote by 
n = |V| the number of nodes and m = |i£| the number of links in L. First notice 
that the number of elements in the configuration space built by the algorithm 
is bounded by the number of subsets of V times the number of sub-intervals of 

T. 

Moreover, for all A-clique C = (A, [fo, e]) in the configuration space, there 
exists a link ( b,u,v ) or a link (b + A ,u,v) in E. Indeed, the initial trivial A- 
cliques are in the first case, and all A-cliques obtained from them are also in this 
case until Line m is applied. The A-cliques built after this are in the second 
case. Likewise, there exists a link (e, u, v ) or a link (e — A, u, v ) in E. Therefore, 
the number of possible values for b and e for any A-clique in the configuration 
space is proportional to the number of time instants at which a link occurs, 
which is bounded by the number of links to. The number of sub-intervals of T 
corresponding to a A-clique in the configuration space is therefore in 0(m 2 ). 
This bound is reached in the worst case, for instance if the stream is a sequence 
of links occurring once every A time interval. 

The trivial bound 0(2 n ) for the number of subsets of V is also reached in 
the worst case, for instance if there is a link between all pairs of nodes at the 
same time: the algorithm will enumerate all subsets of V. 

Therefore, the number of elements in the configuration space is in 0(2 n m 2 ). 
This leads to the space complexity of our algorithm: it is proportional to the 
space needed to store the configuration space, which is in 0(2 n nm 2 ) since each 
element may be stored in 0(n) space. 
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We estimate the time complexity by studying the complexity of operations 
performed on each element of the configuration space, (i.e., the complexity of 
each iteration of the while loop at Lines [3] to [Ml) . Let us consider a A-clique 
C = (A', [ b , e]) picked from S by the algorithm at Line [I] 

The while loop is composed of three blocks: (1) searching for A-cliques of 
the form (.X U {u}, [6, e]) larger than C (Lines iGl to [TUI) : (2) searching for a 
A-clique (A, [6',e]) larger than C (Lines [TT| to [16]); and (3) searching for a A- 
clique (X, [£>, e']) larger than C (Lines [T71 to [Ml) . The third block has the same 
complexity as the second one, so we focus on the time complexity of the two 
first blocks. 

Given a node v ^ X, Line [7] tests whether for all nodes u in X there is a link 
( t,v,u ) £ E in each time interval of duration A. This requires at most |X| • m 
tests, and so it is in 0(nm). Then, Line [9] searches for the found A-clique in 
M, which has a size in 0(2 n m 2 ). Since the comparison between two A-cliques 
can be performed in 0(n) time, this search therefore is in 0(n log(2 n m 2 )) = 
0(n 2 + ?rlogm) time. The algorithm repeats these operations for all nodes 
v £ V \ X, and thus less than n times, hence the complexity of Lines 151 to [TUI is 
in 0(n(nm + n 2 + n log to)) = 0(n 2 mn 4- n 3 ). 

Computing / in Line if ll may clearly be done with at most m tests. Lines mi 
and [Ml are all trivial computations. Lines [T5[ and Line[in]are in 0{n 2 +nlogm). 
The complexity of Lines iTTI to ITT)1 is therefore in 0(m + n 2 + nlogm). 

Finally, each iteration of the while loop costs at most 0(n 2 m + n 3 + m + 
n 2 + nlogm ) = 0{n 2 m + n 3 ) time. We bound the overall time complexity of 
the algorithm by multiplying this by the number of iterations of the while loop, 
which is the number of elements in the configuration space. It is therefore in 
0{2 n m 2 (n 2 m + n 3 )) = 0(2 n n 2 m 3 + 2 n n 3 m 2 ). 

From this analysis, we obtain the following result: 

Theorem 2. Let L = (T, V, E ) be a link stream with | V| = n and |i?| = m, and 
let A be a duration, then Algorithm Q] computes the set of all maximal A-cliques 
of L in 0(2 n nm 2 ) space and 0(2 n n 2 m 3 + 2 n n 3 m 2 ) time. 

Notice that enumerating the maximal cliques in a graph G = (V, E ) is 
equivalent to enumerating the maximal A-cliques in L = ([0,0], V, E r ) where 
(0, u, v) £ E' if and only if (u, v) £ E. The problem of enumerating maximal 
A-cliques in a link stream is therefore at least as difficult as enumerating maxi¬ 
mal cliques in a graph, which has an exponential time complexity (in particular, 
there can be an exponential number of maximal cliques). Therefore any algo¬ 
rithm for enumerating maximal A-cliques in a link stream is at least exponential 
in the number of nodes. 

Notice also that several optimizations may speed up our algorithm (without 
changing its worst-case complexity). In particular, / and l, computed in Lines [Til 
and m are necessarily in [6, min(e, b + A)] and [max(6, e — A), e], respectively. 
One may therefore focus the search on these intervals rather than [6, e]. Likewise, 
if V(C) is the set of nodes satisfying condition of Line 0 then the set V(C') 
of nodes satisfying this condition for the A-cliques C' added to S at Lines [TUI 


m and is included in V{C). One may therefore associate to each element of 
S a set of candidate nodes to be considered at Line [6] in place of V \ X, thus 
drastically reducing the number of iterations of this loop. 

3. Experiments 

We implemented Algorithm |T| with the optimizations discussed above in 
Python (2.7) and provide the source code at [13j] . We illustrate here its prac¬ 
tical relevance by computing maximal A-cliques of the link stream from the 
Thiers-Highschool dataset, which is a trace of real-world contacts between 
individuals, captured with sensors. It was collected at a French high school in 
2012, see [5[ for full details. It induces a link stream of 181 nodes and 45, 047 
links, connecting 2,220 distinct pairs of nodes over a period of 729,500 seconds 
(approximately 8 days). Each link ( t,u,v ) means that the sensor carried by 
individual u or v detected the sensor carried by the other individual at time t, 
which means in turn that these two individuals were close enough from each 
other at time t for the detection to happen. We call this a contact between in¬ 
dividuals u and v. We also have the information of the class to which students 
belong. 

We computed all maximal A-cliques for A = 60 seconds, A = 900 seconds 
(15 minutes), A = 3,600 seconds (1 hour), and A = 10,800 seconds (3 hours). 
We handpicked these values because of the rhythm of school day: on a typical 
day, courses usually last roughly two hours, with two 15 minutes breaks during 
the day, and a longer 1 hour lunch break. Our Python implementation took an 
hour on a standard server 0 to obtain the results. Although many discovered 
A-cliques are very small, we also found rather large and long ones. See Table [T| 
for a summary of these computations. 


A (s) 

\R\ 

Max 

1*1 

Max e — b 
( s ) 

Running 
time (s) 

Memory 

(MB) 

60 

14 664 

5 

6 820 

150 

537 

900 

8 214 

7 

17 420 

555 

4 755 

3 600 

7 170 

7 

36 340 

1 080 

23 186 

10 800 

7 416 

7 

59 560 

3 100 

30 453 


Table 1: Experimental results for computing all maximal A-cliques on the 
Thiers-Highschool dataset. |i?.| is the size of the set returned by our al¬ 
gorithm, (i.e., the number of A-cliques found). For information, storing the 
dataset in RAM requires 51 MB. 


1 A Debian machine with a 2.9 GHz CPU and 64 GB of RAM. 
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We present in Figure [2 for each value of A, the complementary cumula¬ 
tive distributions for the size |X| and duration e — b of all maximal A-cliques 
(X, [6, e]). By definition, larger values of A trivially induce larger and longer 
A-cliques. Indeed, if A' > A then every (maximal) A-clique also is a A'-clique 
(not maximal in general). More intuitively, small values of A detect local bursts, 
but are unable to find periodic behaviors if the period is larger than A. Notice 
that when A grows the number of maximal A-cliques generally decreases, but 
this is not always true, as seen in Table |T] For an example of how the impact of 
A on the number of maximal A-cliques is not trivial, consider the stream pre¬ 
sented in Figure [T] it contains four maximal 1-cliques, six maximal 2-cliques, 
and four maximal 3-cliques. 



Figure 4: Left: complementary cumulative distribution of A-clique sizes for 
different values of A. Right: complementary cumulative distribution of A- 
clique durations for different values of A. The sharp drop at 2 ■ A is due to 
A-cliques involving only one link. 


Notice now that Algorithm [T] makes no assumption on the order in which 
elements of S are processed, which corresponds to the way we explore the con¬ 
figuration space. In particular, if S is a first-in-first-out structure (a queue), the 
algorithm performs a BFS of the configuration space; if it is a last-in-first-out 
structure (a stack) then it performs a DFS. The execution time is essentially 
the same in all cases. The size of S may vary, but the space complexity of the 
algorithm is dominated by the size of M, that does not change. Still, the data 
structure impacts the order in which A-cliques are found. 

We illustrate this in the practical case where A = 3600 seconds (1 hour), 
see Figure El It shows that DFS rapidly discovers many cliques, and that those 
cliques are non-trivial cliques (cliques involving more than 2 nodes or lasting 
a substantial amount of time). In this case, using a DFS is therefore more 
interesting than a BFS, as it outputs results and exhibits non-trivial A-cliques 
faster. However, this behavior is dependent on the dataset, and deciding on the 
most appropriate exploration strategy in a given case remains an open question. 

Consider now Q = (V, £) the graph induced by link stream L = ( T,V,E ): 
V = V and £ = {(u,t>) : 3(t,u,v) £ E}. In other words, this is the graph 
where a link exists between two nodes u and v if and only if there is at least 
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Figure 5: Behavior of our algorithm depending on the way it explores its config¬ 
uration space (DFS or BFS). Left: number of maximal cliques discovered as a 
function of the number of iterations of the main loop of the algorithm. Right: 
maximal size of discovered cliques as a function of the number of iterations of 
the main loop of the algorithm. A clique size is estimated here by its number 
of nodes times its duration (in seconds). 


one contact between u and v in the link stream. The graph Q contains 1742 
cliques, the largest one involving 14 nodes. Approximately 70% of them involve 
students in the same class. 

If (X, [ b , ej) is a (maximal) A-clique of L, then by definition X is a clique of Q 
(in general not maximal). However, as A-cliques capture time information they 
shed light on different patterns. For instance, L contains a 60-clique involving 
4 students of different classrooms during roughly 5 minutes, which is likely to 
be the signature of a coffee break. Such A-cliques are non-trivial outputs of our 
algorithm, but they are invisible when considering graph cliques. 


4. Conclusion 


We introduced the notion of A-cliques in link streams, and proposed the 
first algorithm to compute the maximal such A-cliques. We implemented this 
algorithm and detected interesting A-cliques in real-world data. 

Clearly, our algorithm may be improved further. Trying to adapt the Bron- 
Kerbosch algorithm [l] and some of its variants Xj], 0 0 4], the most widely 
used algorithms for computing cliques in graphs, is particularly appealing. In¬ 
deed, the configuration spaces built by these algorithms are trees, which avoids 
redundant computations. This is achieved by maintaining a set of candidate 
nodes that may be added to previously discovered cliques, which does not di¬ 
rectly translate to our situation because of time in link streams. Still, we believe 
that progress is possible in this direction. 

We also consider the case of links with duration as a promising perspective: 
each link (6, e, u, v) means that u and v interact from time b to e. In this case 
there is no need for a A anymore, as density in this context is nothing but the 
probability that two randomly chosen nodes are linked together at a randomly 
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chosen time. The definition of cliques in link streams with durations follows 
directly, and our algorithm may be extended to compute maximal such cliques. 
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